Inferring The Latent Structure of Human Decision-Making from Raw Visual Inputs
نویسندگان
چکیده
The goal of imitation learning is to mimic expert behavior without access to an explicit reward signal. Expert demonstrations provided by humans, however, often show significant variability due to latent factors that are typically not explicitly modeled. In this paper, we propose a new algorithm that can infer the latent structure of expert demonstrations in an unsupervised way. Our method, built on top of Generative Adversarial Imitation Learning, can not only imitate complex behaviors, but also learn interpretable and meaningful representations of complex behavioral data, including visual demonstrations. In the driving domain, we show that a model learned from human demonstrations is able to both accurately reproduce a variety of behaviors and accurately anticipate human actions using raw visual inputs. Compared with various baselines, our method can better capture the latent structure underlying expert demonstrations, often recovering semantically meaningful factors of variation in the data.
منابع مشابه
A Two-stage DEA Model Considering Shared Inputs, Free Intermediate Measures and Undesirable Outputs
Data envelopment analysis (DEA) has been proved to be an excellent approach for measuring the performance of decision-making units (DMUs) that use multiple inputs to generate multiple outputs. But the allocation problem of shared inputs and undesirable outputs does not arouse attention in this movement. This paper proposes a two-stage DEA model considering simultaneously the structure of shared...
متن کاملProvide a New Targeting Model in a Centralized Decision Making Environment with a Multi-Component Network Structure
This research seeks to develop resource and goals allocation planning models in a focused decision-making environment with a parallel multi-component network structure in a case study. In such an environment, the problem of resource and goals allocation planning, the determination of the input and output of each of the decision-making units in achieving the goals of the system is such that the ...
متن کاملRanking Network-Structured Decision-Making Units and Its Application in Bank Branches
Data envelopment analysis (DEA) is a method used for measuring the efficiency of decision-making units. Unlike the standard models, which assume decision-making units to be a black box, network data envelopment analysis focuses on the internal structure of these units. Some researchers have developed a two-stage method where all the inputs are entirely used in the first stage, producin...
متن کاملVisual Interaction Networks
From just a glance, humans can make rich predictions about the future state of a wide range of physical systems. On the other hand, modern approaches from engineering, robotics, and graphics are often restricted to narrow domains and require direct measurements of the underlying states. We introduce the Visual Interaction Network, a general-purpose model for learning the dynamics of a physical ...
متن کاملCommon weights for the evaluation of decision-making units with nonlinear virtual inputs and outputs
In this paper, by investigating the common weights concept and DEA models with nonlinear virtual inputs/outputs, we introduce a model for evaluating the decision making units with nonlinear virtual inputs and outputs based on the common weights.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1703.08840 شماره
صفحات -
تاریخ انتشار 2017